NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

A discretization-invariant extension and analysis of some deep operator networks

https://doi.org/10.1016/j.cam.2024.116226

Zhang, Zecheng; Leung, Wing Tat; Schaeffer, Hayden (March 2025, Journal of Computational and Applied Mathematics)

Free, publicly-accessible full text available March 1, 2026
TIME-SERIES FORECASTING AND REFINEMENT WITHIN A MULTIMODAL PDE FOUNDATION MODEL

https://doi.org/10.1615/JMachLearnModelComput.2025057618

Jollie, Derek; Sun, Jingmin; Zhang, Zecheng; Schaeffer, Hayden (January 2025, Journal of Machine Learning for Modeling and Computing)

Symbolic encoding has been used in multioperator learning (MOL) as a way to embed additional information for distinct time-series data. For spatiotemporal systems described by time-dependent partial differential equations (PDEs), the equation itself provides an additional modality to identify the system. The utilization of symbolic expressions alongside time-series samples allows for the development of multimodal predictive neural networks. A key challenge with current approaches is that the symbolic information, i.e., the equations, must be manually preprocessed (simplified, rearranged, etc.) to match and relate to the existing token library, which increases costs and reduces flexibility, especially when dealing with new differential equations. We propose a new token library based on SymPy to encode differential equations as an additional modality for time-series models. The proposed approach incurs minimal cost, is automated, and maintains high prediction accuracy for forecasting tasks. Additionally, we include a Bayesian filtering module that connects the different modalities to refine the learned equation. This improves the accuracy of the learned symbolic representation and the predicted time-series.
more » « less
Full Text Available
Bayesian Deep Operator Learning for Homogenized to Fine-Scale Maps for Multiscale PDE

https://doi.org/10.1137/23M160342X

Zhang, Zecheng; Moya, Christian; Leung, Wing Tat; Lin, Guang; Schaeffer, Hayden (September 2024, Multiscale Modeling & Simulation)

Full Text Available
D2NO: Efficient handling of heterogeneous input function spaces with distributed deep neural operators

https://doi.org/10.1016/j.cma.2024.117084

Zhang, Zecheng; Moya, Christian; Lu, Lu; Lin, Guang; Schaeffer, Hayden (August 2024, Computer Methods in Applied Mechanics and Engineering)

Full Text Available
SRMD: Sparse Random Mode Decomposition

https://doi.org/10.1007/s42967-023-00273-x

Richardson, Nicholas; Schaeffer, Hayden; Tran, Giang (June 2024, Communications on Applied Mathematics and Computation)

Full Text Available
Conditioning of random Fourier feature matrices: double descent and generalization error

https://doi.org/10.1093/imaiai/iaad054

Chen, Zhijun; Schaeffer, Hayden (April 2024, Information and Inference: A Journal of the IMA)

Abstract We provide high-probability bounds on the condition number of random feature matrices. In particular, we show that if the complexity ratio $N/m$, where $$N$$ is the number of neurons and $$m$$ is the number of data samples, scales like $$\log ^{-1}(N)$$ or $$\log (m)$$, then the random feature matrix is well-conditioned. This result holds without the need of regularization and relies on establishing various concentration bounds between dependent components of the random feature matrix. Additionally, we derive bounds on the restricted isometry constant of the random feature matrix. We also derive an upper bound for the risk associated with regression problems using a random feature matrix. This upper bound exhibits the double descent phenomenon and indicates that this is an effect of the double descent behaviour of the condition number. The risk bounds include the underparameterized setting using the least squares problem and the overparameterized setting where using either the minimum norm interpolation problem or a sparse regression problem. For the noiseless least squares or sparse regression cases, we show that the risk decreases as $$m$$ and $$N$$ increase. The risk bound matches the optimal scaling in the literature and the constants in our results are explicit and independent of the dimension of the data.
more » « less
Full Text Available
HARFE: hard-ridge random feature expansion

https://doi.org/10.1007/s43670-023-00063-9

Saha, Esha; Schaeffer, Hayden; Tran, Giang (December 2023, Sampling Theory, Signal Processing, and Data Analysis)

Full Text Available
Random feature models for learning interacting dynamical systems

https://doi.org/10.1098/rspa.2022.0835

Liu, Yuxuan; McCalla, Scott G.; Schaeffer, Hayden (July 2023, Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences)

Particle dynamics and multi-agent systems provide accurate dynamical models for studying and forecasting the behaviour of complex interacting systems. They often take the form of a high-dimensional system of differential equations parameterized by an interaction kernel that models the underlying attractive or repulsive forces between agents. We consider the problem of constructing a data-based approximation of the interacting forces directly from noisy observations of the paths of the agents in time. The learned interaction kernels are then used to predict the agents’ behaviour over a longer time interval. The approximation developed in this work uses a randomized feature algorithm and a sparse randomized feature approach. Sparsity-promoting regression provides a mechanism for pruning the randomly generated features which was observed to be beneficial when one has limited data, in particular, leading to less overfitting than other approaches. In addition, imposing sparsity reduces the kernel evaluation cost which significantly lowers the simulation cost for forecasting the multi-agent systems. Our method is applied to various examples, including first-order systems with homogeneous and heterogeneous interactions, second-order homogeneous systems, and a new sheep swarming system.
more » « less
Full Text Available
Generalization bounds for sparse random feature expansions

https://doi.org/10.1016/j.acha.2022.08.003

Hashemi, Abolfazl; Schaeffer, Hayden; Shi, Robert; Topcu, Ufuk; Tran, Giang; Ward, Rachel (January 2023, Applied and Computational Harmonic Analysis)

Full Text Available
Reduced order modeling using shallow relu networks with grassmann layers

Bollinger, Kayla; Schaeffer, Hayden (January 2022, Proceedings of Machine Learning Research 2021 2nd Annual Conference on Mathematical and Scientific Machine Learning)

Full Text Available

« Prev Next »

Search for: All records